Investigating syllabic structures and their variation in spontaneous French

نویسندگان

  • Martine Adda-Decker
  • Philippe Boula de Mareüil
  • Gilles Adda
  • Lori Lamel
چکیده

The paper presents a study of syllabic structures and their variation in a large corpus of French radio interview speech. A further aim is to show how automatic speech recognition (ASR) systems can serve as a linguistic tool to consistently explore virtually unlimited speech corpora. Automatically selected subsets can be manually checked to accumulate knowledge on pronunciation variants. Our belief is that better formalised knowledge of variant mechanisms will ultimately contribute to improve pronunciation modelling and ASR systems. This study is meant to be a step in this direction. The linguistic phenomena we are particularly interested in, are sequential variants (i.e. variants with different numbers of phonemes) which may or not entail syllabic restructuring. These variants, frequent in spontaneous speech, are known to be particularly difficult for speech recognizers. To focus on sequential variants, a methodology has been set up using descriptions at the phonemic, syllabic and lexical levels. This study reports on a radio corpus composed of thirty 1-hour shows of interviews. Spontaneous speech is found to have a larger proportion of closed syllables than found in the canonical syllables derived from orthographic transcriptions. As expected, the optional schwa contributes to a large amount of variation in syllabic structure. Less well described phenomena are also observed, such as other vowels (/u/, /E/, /i/ and /a/) being deleted in a non-final (unstressed) position. Unstressed CV syllables, when preceded by an open syllable, are likely to undergo syllabic restructuring: vowel deletion together with backward onset-coda transfer. Complex syllables tend to be simplified: liquid consonants are often deleted, more often in coda than onset position. /v/ is the most deletion-prone consonant in both onset and coda positions. Finally, a substantial percentage of occurrences of word-final schwa syllables may completely disappear. Résumé Dans ce papier, nous traitons des structures syllabiques et de leur variation dans un corpus de parole en français issu d’entrevues radio-diffusées. Un des buts est de montrer comment des systèmes de reconnaissance automatique de la parole (RAP) peuvent servir d’outils linguistiques pour explorer de façon cohérente des corpus virtuellement illimités. Des sous-ensembles automatiquement sélectionnés peuvent être vérifiés manuellement pour accroître notre connaissance des variantes de prononciation. Notre conviction Preprint submitted to Elsevier Science 26 October 2005 est qu’une meilleure formalisation des mécanismes à l’oeuvre dans la parole contribuera en définitive à améliorer la modélisation des prononciations et les systèmes de RAP: cette étude se veut une étape dans cette direction. Les phénomènes linguistiques auxquels nous nous intéressons en particulier sont les variantes séquentielles (i.e. celles qui induisent un nombre variable de phonèmes), qui peuvent selon les cas conduire à une restructuration syllabique: ces variantes, fréquentes en parole spontanée, sont connues pour poser problème à la reconnaissance. Pour se focaliser sur elles, une méthodologie a été mise au point, utilisant des descriptions aux niveaux phonématique, syllabique et lexical Cette étude repose sur un corpus de parole de radio constitué de trente émissions d’une heure. La parole spontanée révèle une plus grande proportion de syllabes fermées que dans les syllabes canoniques dérivées des transcriptions orthographiques: comme on peut s’y attendre, le schwa optionnel contribue pour une grande part à la variation de structure syllabique. Des phénomènes, moins bien décrits ont également été observés : d’autres voyelles telles que /u/, /E, /i/ et /a/ peuvent tomber en position inaccentuée (non finale). Les syllabes CV non accentuées, précédées d’une syllabe ouverte, sont enclines à la restructuration : effacement de la voyelle et transfert attaque-coda. Les syllabes complexes tendent à être simplifiées : les consonnes liquides tombent souvent, plus en position de coda qu’en position d’attaque. Le /v/ est la consonne la plus facilement élidée indépendamment de sa position dans la syllabe. Enfin un pourcentage substantiel de syllabes faibles de fin de mot, ayant un schwa comme noyau, peuvent disparaître complètement.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating Syllabic Structure and Its Variation in Speech from French Radio Interviews

In this paper, we investigate syllabic structure and its variation in a corpus of French radio interview speech. The aim of this study is to relate sequential pronunciation variants, i.e. variants with different numbers of phonemes to syllabic restructuring. In French schwa and liaison are two well-known phenomena which allow for a variable number of phonemes. We first aim to quantify syllabic ...

متن کامل

Predicting vowel duration in spontaneous canadian French speech

This study examines variables influencing vowel duration of French spoken in Windsor, Ontario, in order to see whether their respective effects on vowel duration are organised hierarchically. We first consider the data distribution of four female speakers before carrying out a statistical principal components analysis. Our results show that the variables are classified into three underlying fac...

متن کامل

Towards Automatic Annotation of Temporal Features in Discourse: The Case of Syllabic Duration in Spontaneous French

Numerous discourse functions systematically resort to the prosodic resources found in the speakers’ management of temporal features. These resources concern pauses, syllabic duration modifications and speech rate. However, the description and modelling of temporal phenomena constitutes a particularly delicate endeavour, mainly because, contrary to intonation and intensity, they cannot be based ...

متن کامل

Prosodic Characteristics of Read and Spontaneous Speech in French

This study compares the read and spontaneous speech prosodic characteristics of two relatively small corpora in French (about 3 minute’s length). Acoustic data such as syllabic rate, number of effective stressed syllable vs. theoretical prediction, prosodic hierarchy and realization of melodic contours are compared for both styles. The predicting power of two theoretical approaches, autosegment...

متن کامل

The SpeakingInfluence of Style on Lexical f Profiles in French

This study presents a comparison of French lexical fundamental frequency (f0) profiles for different speaking styles using phonemic, syllabic and lexical transcriptions as well as partof-speech annotations. Three speaking styles (broadcast news, broadcast conferences and conversations) with over 20 hours of speech were used. Syllabic word length and POS were considered as influential factors. R...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Speech Communication

دوره 46  شماره 

صفحات  -

تاریخ انتشار 2005